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1 Summary of Results 



In their fundamental work Paul Erdos and Alfred Renyi [1] considered the 
evolution of the random graph G{n, p) as p "evolved" from to 1. At p = 1/n 
a sudden and dramatic change takes place in G. When p = c/n with c < 1 
the random G consists of small components, the largest of size 0(logn). 
But by p = c/n with c > 1 many of the components have "congealed" into 
a "giant component" of size G(n). Erdos and Renyi called this the double 

^N I jump, the terms phase transition (from the analogy to percolation) and Big 

(<— ^ ' Bang have also been proferred. 

Tj" . Now imagine an observer who can only see G through a logical fog. He 

^^ I may refer to graph theoretic properties A within a limited logical language. 

T^ ' Will he be able to detect the double jump? The answer depends on the 

d • strength of the language. Our rough answer to this rough question is: the 

double jump is not detectible in the First Order Theory of Graphs but it is 
detectible in the Second Order Monadic Theory of Graphs. These theories 
will be described below. We use the abbreviations fotog and somtog for these 
two theories respectively. 



C^ \ For any property A and any c > we define 

/(c) = /a(c) = Jim Pr[G(n, c/n) [= A] 

Here G \= A means that G satisfies property A. Beware, however, that we 
cannot presuppose the existence of /(c) as the limit might not exist. 
Theorem 1. Let A be a fotog sentence. The fA{c) exists for all c and /^ 
is an infinitely differentiable function. Moreover Ja belongs to the minimal 
family of functions J- which contain the functions 0, 1 and c, are closed under 



addition, subtraction, and multiplication by rationals and are closed under 
base e exponentiation so that / G ^ =^ e-^ € ^. 

Examples. Let A be "there exists a triangle". Then /(c) = e^'^' '^. Let B 
be "there exists an isolated triangle", i.e., a triangle with none of the three 
vertices adjacent to any other vertices besides themselves. Then 



fsic) 
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Finally, call a triangle x, y, z unspiked if there is no point w which is adjacent 
to exactly one of x, y, z and no other point. Let C be the property that there 
is no unspiked triangle. Then 



fcic) = e~ 



^clg-clce /g 



Since 1 is not a special value for these / we say, roughly, that the double 
jump is not detectible in fotog. Our remaining results all concern somtog. 
Theorem 2. There is a somtog A with 

„ , , / if c < 1 
^^(^) = \ 1 ifol 

Theorem 3. For all somtog A and c < 1 the value /yi(c) is well defined. 

Theorem 4. For all cq > 1 there is a somtog A with /yi(c) not defined for 

c > Co. 

Theorem 5. For any c < 1 and e > there is a decision procedure that will 

determine /yi(c) within e for any somtog A. 

Theorem 6. Let c > 1. Then there is no decision procedure that separates 

the somtog A with fAic) = from those with fAic) = 1. 

Certainly the situation with c=l is most interesting but we do not dis- 
cuss it in this paper. 

Description of Theories: The First Order Theory of Graphs (fotog) con- 
sists of an infinite number of variable symbols (x, y,z . . .), equality (x = y) 
and adjacency (denoted x ^ y) symbols, the usual Boolean connectives 
(a, V, -1 . . .) and universal and existential quantification (V^;, 3^^, . . .) over the 
variables which represent vertices of a graph. Second Order Monadic Theory 
of Graphs [somtog) also includes an infinite number of set symbols(S', T,U . . .) 
which represent subsets of the vertices and membership (g) between vertices 
and sets (x £ S). The set symbols may be quantified over {^Si^T ■ ■ ■) as 
well as the variables. As an example, in fotog we may write 

'ix'^y^z^wix ^zAz^wAw^-^y] 



which means that all pairs of vertices are joined by a path of length three. 
However, one cannot say in fotog that the graph is connected. In somtog 
we define path(x, y, S) to be that x,y (z S and every z G S is adjacent to 
precisely two other w G S except for x and y which are each adjacent to 
precisely one point of S. This has the interpretation that S gives an induced 
path between x and y. The statement 3spath{x, y, S) holds if and only if x 
and y lie in the same component since if they do a minimal path S would 
be an induced path. We write conn{x, y, R) for 3sS C R A path{x, y, S) 
which means that in the restriction to R, x and y lie in the same compo- 
nent. The property "G is connected" is represented by the somtog sentence 
'\/x'^y3sP0'th{x,y, S). This ability to express x and y being joined by some 
path of arbitrary size seems to give the essential strength of somtog over 
fotog. Now we can prove Theorem 2. Let circ{S) be the sentence that S is 
connected and that every v € S is adjacent to precisely two w G S. Consider 
the sentence 

A : 3s,T,Rcirc{S) Acirc{T) ASnT = 0AS' C RAT C RAya:,yeRConn{x,y,R) 

This has the interpretation that the graph contains a component (R) with 
two disjoint circuits. For this A it is well known that /a(c) = when c < 1 
and /a(c) = 1 when c > 1. 

2 The First Order World 

The results of this section were done independently and in more complete 
detail in Lynch [1]. Here we attempt to give a more impressionistic picture 
of G{n, c/n) through First Order glasses. 

What does G(n,p) look like? To begin with, there are lots of trees. 
More precisely, for any tree T and any r there are almost surely more than 
r copies of T as components of the graph. (This includes the trivial case 
where T is a single vertex.) What about more complicated structures. Let 
B(x,R) denote the set of vertices within distance R of x, where we use 
shortest path as the metric. The veracity of a fotog A depends only on 
the values B{x,R) for a fixed (dependent on A) R. (This is most certainly 
not the case for somtog.) For any fixed c and R a. a. all B(x,R) will be 
either trees or unicylic graphs. (For c > l,G{n,c/n) will have many cycles 
in the giant component but they will be far apart.) To make things a bit 
bigger let's define H = TIr to be the set of graphs H consisting of a cycle 
of size at most R and trees of depth at most R rooted at each vertex of the 



cycle. For C C G let B{C,R) denote the set of vertices within distance R 
of some X £ C. For any H (^TC let Xh denote the number of cycles C with 
B{C,R) = i^.When H has v vertices, and hence v edges, with w vertices 
not at depth R 

E[Xh] = r)p"(l -P)""/ I AutiH) h A^ 

where 

A^ = c^e-'"yvl I Aut{H) \ 

Moreover, the Xjj act as independent Poisson distributions in that for any 
Hi,...HsGn 

s 

limn-.ooPr[XH, =Ci,l<i<s\ = \{ \%e-^^^lci\ 

The values B{C, R) can be generated as follows. For each 3 < i < R the 
number of i-cycles is Poisson with mean 0^/21. We generate these cycles and 
then from each vertex generate a pure birth process with Poisson c births. 

To check the veracity of a fotog A one needs only examine B{x,R) and 
further one doesn't need to be able to count higher than R. Lets say two 
numbers are i?-same if they are either equal or both at least R. Say two 
rooted trees of depth 1 are i?-equivalent if the degrees of their roots are 
i?-same. Clearly there are R + 1 equivalence classes. Suppose by induction 
/^-equivalent has been defined on rooted trees of depth i. A rooted tree of 
depth i + 1 may be identified naturally with a set of rooted trees of depth i. 
We call two rooted trees of depth i + li?-equivalent if every i?-equivalence 
class of rooted trees of depth i appears the i?-same number of times as a 
subtree. We say H,H' £ 7i are i?-equivalent if their cycles are the same 
size and the vertices can be ordered about the cycles so that the rooted 
trees emenating from the corresponding vertices are i?-equivalent. We say 
graphs G, G' are i?-equivalent if every equivalence class oi H £ Ti appears 
the i?-same number of times as a B{G^ R). 

We use the following result about fotog. For every fotog A there is an 
R with the following property. Let G, G' be i?-equivalent, both with all 
B(x,R) either trees or unicylic. Suppose further that every i?-equivalence 
class of rooted trees of depth R appears at least R times as a B{x,R) in 
both graphs. Then 

G^A-^^G'^A 



With R fixed and c fixed the rooted trees above certainly appear. By 
induction on the depth we may show that every /^-equivalence class oi H £ TC 
appears Poisson A(c) times in G{n, c) where \ £ J-. Each i?-equivalence class 
of G occurs with probability /(c) € J-. Then i-*r[^] is the finite sum of such 
/(c) and is also in T. 

3 The Second Order Monadic World Before the 
Double Jump 

Here we prove Theorem 5 and hence the weaker Theorem 3. Fix c > and 
e > 0. There are K, L so that with probability at least 1 — e all components of 
G(n, c/n) are either trees or unicylic components of size at most K, and there 
are less than L of the latter. Each possible family of unicyclic components 
holds with a calculatable limit probability. Knowing the precise unicyclic 
components and that G(n, c/n) has "all" trees and no other components 
determines the veracity of A. Hence Pr[A] is determined within e. 

4 The Second Order Monadic World After the 
Double Jump 

We prove Theorem 6 by a reduction to the Trakhtenbrot-Vought Theorem, 
which states that there is no decision procedure which separates those fotog 
A which hold for some finite graph from those which do not. By a clean 
topological Tk{CTKk) in a graph G we mean an induced subgraph consisting 
of k vertices, one path between every pair of points, and nothing else. In §5 
we show for all c > 1 and all integers k that G(n, c/n) almost surely contains 
a CTK]^. For any fotog A we define a somtog A'^ of the form 

A+ : 3s,T,uclean{S, T) A A* 

Here clean{S, T) represents that S is the vertex set of a CTKk on T. That 

is 

(i) SCT 

(ii) Every x,y £ S have a unique T^^y C T with part{x, y, T^^y) and Tx^yCiS = 

{x,y}- 

(iii) There is no edge between any T^^y and T^^y' except at the endpoints. 

Now we transform A to A* by 
(i) replacing V^; and 3y by ^^es and 3y^s 
(ii) replacing a; ~ y by T^^y (lU ^ ^, with T^^y defined above. 



If A holds for no finite graph then A'^ holds for no finite graph. Suppose 
A holds for a finite graph H on, say, k vertices 1 . . . k. Almost surely G 
contains a CTK^ on vertices T with endpoints S. Label S by xi . . . x^ 
arbitrarily. Let U consist of one vertex from T^j,^ (not an endpoint) for 
each pair {xi,Xj} with {i,j} € H. Then A* holds. That is, A~^ holds a. a. 

A decision procedure that could separate somtog B with /b(c) = 1 from 
those with fsic) = could, when applied to i? = A'^ , be used to determine 
if A held for some finite graph, and this would contradict the Trakhtenbrot- 
Vought Theorem. 

Nonconvergence. To prove Theorem 4 we use a somewhat complicated 
graph. Let ki be a positive real and K a positive integer, (ki = 5,K = 100 
is a good example.) We define, for all sufficiently large n, a graph H = 
H{ki,K, n). Let w be the nearest integer to ki logn divisible by K (a tech- 
nical convenience) so that w ^ ki log n. (Asymptotics are in n for fixed 
ki,K. Begin with two points SO, SI and three vertex disjoint paths, each 
of length w, between them. Call this graph H~ . Let AR (which stands for 
arithmetizable) consist of every i^-th vertex on each of the paths, excluding 
the endpoints. Thus AR will have ^ — 1 points from each path, a total 
of wi = 3[^ — 1] points. Order the three paths arbitrarily and order the 
points of AR on a path from 5*0 to 5*1 so that the points of AR are labelled 
1,. . . ,wi. Now, using this labelling, between every pair i,2i add a path of 
length w. (These paths all use new vertices with no additional adjacencies.) 
Now between every pair i,2* add a path of length w. Now between every 
pair i, tower{i) add a path of length w. (The function tower{i) is defined 
inductively by tower{\) = 2,tower{i + 1) = 2*°"'^'"(*).) Finally between ev- 
ery pair i,wow{i) add a path of length w. (The function wow{i) is defined 
inductively by wow{i) = 2, wow{i + 1) = tower{wow{i)).) This completes 
the description of the graph H = H{ki,K, n). 

In §5 we prove that for every c > 1 there exist ki,K so that G{n,c/n) 
almost surely contains an induced copy oi H = H{ki,K,n). We assume 
that here, and with H in mind construct a somtog sentence A = Ak which 
shows nonconvergence. 

The sentence A = Ak will be built up in stages. First we say there 
exist vertices 50,5*1 and sets Pi,P2,P3 so that each Pj gives a path from 
50 to 51, the Pi overlap only at 50,51, and there are no edges between 
Pi and Pj except at the endpoints. Second we say there exists a set AR C 
Pi U P2 U P3 — {50, 51} so that for any path xi • • • xk in any Pj that AR 
contains exactly one of the xi, . . . , xk- (Here the sentence depends on the 
choice of the fixed integer K.) We define an auxilliary binary relation < on 



AR. If x,y £ AR n Pi we define x < y hy the existence of a subset of Pi 
wliich is a path from 5*0 to x which does not contain y. If x G AR n Pi and 
y E AR n Pj with i ^ j we define a; < y to be i = 1, j = 2 or i = 1, j = 3 or 
i = 2,j = 3. On AR we define the auxilfiary binary predicate next{i,j) by 
i < j and there does not exist k G AR with i < /c and k < j. We define the 
unary predicate ONE{i) by i G ^i? and there is no j < i and TWO{i) by 
i G ^i? and j < i ^-^ ONE{j). We say there are unique i,j with ONE{i), 
TWO{j). For convenience we write 1,2 for these elements henceforth. 

Now to arithmetize AR. We say there exists vertex sets DOUBLE, 
EXP, TOWER and WOW. We define auxilhary binary predicate double 
on AR by double{x, y) if x < y and there is a path from z to y in DOUBLE; 
and we similarly define binary predicates exp, tower and tt;oti;. We say 
double{l,2) and double{x,y) Ci double{x , z) -^ y = z and double{x,y) fl 
next(x,xi) n next{y,yi) fl next{yi,y2) —>■ double{xi,y2) and if double{x,y) 
and next(x,xi) and there do not exist yi,y2 with next{y,yi) D next{yi,y2) 
then there does not exist z with double{xi,z) and finally if double{x,y) 
and x' < X then there exists y' with double{x' ,y'). We say exp{l,2) and 
exp{x, y) n exp{x, z) ^ y = z and exp(x, y) fl next{x, xi) n double{y, yi) -^ 
exp{xi,yi) and if exp{x,y) and next(x,xi) and there does not exist yi 
with double{y,yi) then there does not exist z with exp{xi,z) and finally 
if exp{x,y) and x' < x then there exists y' with exp{x',y'). The properties 
for tower are in terms of exp exactly as the properties for exp were in terms 
of double and the properties for wow are in terms of tower in the same way. 

On AR we define unary predicates even{x) by 3ydouble{y, x) and invwow{x) 
by there existing y with wow{x,y) but for all x' > x there do not exist y' 
with wow{x' ,y'). The sentence A = yl/^: concludes by saying there exists x 
with even{x) fl invwow{x). 

Now we show that limPr[G(n,p) |= yl;^] does not exist, moreover that 
the limsup is one and the liminf is zero. On the integers define wow~^{y) 
to be the biggest integer x with wow{x) < y. First let n — >■ oo through 
that subsequence for which wow~^{wi) is even. (Recall wi = O(logn) was 
the size of AR.) Suppose G{n,p) contains an induced copy of H. {ki,K 
depend only on c and so are already fixed.) On H there do exist the vertices 
Si), SI, the sets Pi,P2,P3, AR, DOUBLE, EXP, TOWER with all the 
properties of A^. (Indeed, Ak was created with that in mind.) Under 
the labelling l,...,wi the predicates double,... correspond to the actual 
numbertheoretic predicates and the x = wow^^{wi) has invwow{x) and 
even{x) so Ak holds. But G{n,p) contains an induced copy of H almost 
surely so the limiting probability on this subsequence is one. 

7 



In the other direction, let n go to infinity through a subsequence with 
the property that for ah m with (leaving some room) log log n < m < n the 
value wow~^{m) is odd. (Such n exist since wow—1 is constant for such 
a long time.) Here is the crucial random graph fact: There is a 5 = 5{c) 
so that in G{n, c/n) almost surely all subconfigurations consisting of two 
vertices and three paths between them have size at least 6 log n. (This uses 
a simple expectation argument. The number of configurations of t vertices 
and t + 1 edges giving the above graph is 0{n^p^'^^) = 0{c*'^^ /n) = o(l) 
when t < 6logn.) Thus almost surely any AR that satisfies the conditions of 
Ak will have \AR\ = m > 6'\ogn > log log n. The conditions on double, . . . 
force AR to be arithmeticized so that 3xinvwow{x) r)even{x) will not occur 
when wow~^{m) is odd. Thus almost surely Ak will not be satisfied. 

5 A Variance Calculation 

We fix c > 1, set p = c/n and let G ~ G{n,p). We consider a graph 
H = H{ki,K,n) as defined in §4. We give a description of H suitable for 
our purposes. Set w = \ki log n\ . Take two vertices and draw three vertex 
disjoint paths each of length w. This gives a graph H^. On H~ a set of 
pairs of vertices {a, a'} are specified, no a lying in more than eight such 
pairs. We let I denote the precise number of such pairs so that / ~ elogn. 
By making K large we can make e as small as desired. Between each such 
pair a path of length w is placed with new vertices. This gives the graph H. 
It has V = 0(ln n) vertices and e edges where e = 3w + lw ^ eki log n and 
e — v = l + l^e log n. Let us denote the vertices oi H hj 1, . . . ,v. 

5.1 The Second Moment Method 

Our object in this section is to show that, for appropriate ki,e, the random 
G{n,p) almost surely contains an induced copy of H. Let X be the number 
of f-tuples (oi, . . . , a„) of distinct vertices of G so when {i,j} € E{H) then 
{ai,aj} € E{H). That is, X is a count of copies oi H \n G though these 
copies may have extra edges and a given copy may be multiply counted if 
H has automorphisms. Clearly 

E[X] = {n)^p' ~ nV = cyn^-" 

which is 

^(elogn)(A;i logc-l+o(l)) 



from the estimates above. We first require that 

ki log c > 1 

which assures that E[X] is a positive power of n^°^". The crucial calculation 
will be to show 

Var[X] = o{E[Xf) (Vl) 

/,From this, by Chebyschev's Inequality X > .99-E[X] (say) almost surely. 
True, X counts noninduced copies of H. But let X~^ be a count of all 
copies of any H^ consisting of H with one additional edge added. There are 
0(log^ n) choices of that edge and for a given choice the expected number of 
such copies is pE[X] so that E[X+] = O [log'^ n/n)E[X] = o{E[X]) and so 
by Markov's Inequality almost surely X~^ < E[X]/2, say. So almost surely 
there are more than .99£'[X] copies of H and fewer than .5E[X] total copies 
of graphs containing H and one more edge so therefore there is at least one 
copy of H with no additional edge, i.e., the desired induced copy. 

Hence it sufhces to show (VI). 

Remark. To illustrate the complexities suppose we condition G{n,p) on 
a fixed copy of H^ and let Z be the expected number of extensions to H. 
The expectation argument above gives that E[Z] ~ [n^~^p'^y = (c^ /nY 
which is n ( s'^)_ However for there to be any extensions each of the at 
least 1/8 vertices of H~ that is supposed to have a path coming out of it 
must have at least one edge besides those of H^ . Any particular vertex fails 
this condition with probability e~'^ and these events are independent so that 
the probability that Z j^ is bounded from above by (1 — e~^) '^ which is 
polynomially small. This illustrates that the expected number of thingees 
being large does not a priori guarantee that almost surely there is a thingee. 

Of course, (VI) is equivalent to showing E[X'^] ~ £'[X]^. By the sym- 
metry of copies, £^[X^] is E[X] times the expected number of copies of H 
given the existence of a particular copy of H. Let 

V = {l,...,m} 

and let us specify a particular copy of H on vertex set V with 1, . . . , 3w — 1 
being the vertices of H~ . Let G* = G*{n,p) be the random graph on 
vertex set l,...,n where for i,j G V and {i,j} € E{H) we specify that 
{i,j} G E{G) but all other pairs i,j are adjacent in G with independent 
probabilities p. (Note that even those i,j with i,j G V but {i,i} E{H) 



have probability p of being in G* .) Let £^*[X] denote the expectation of X 
in G* . Then it suffices to show 

E*[X]'^E[X] (¥2) 

5.2 The Core Calculation: Expectation for Paths 

We shah work up to i? * [X] in stages. Let Ps{a,b) denote the expected 
number of paths of length s between vertices a, b, with the graph distri- 
bution G*{n,p). (As a benchmark note that in G{n,p) this expectation 
would be (n — 2)s-ip^ ~ n^~^p^.) Ps{a, b) is simply the sum over all tuples 
(ao, . . . ,as) with oq = a,as = b of distinct vertices of G of p" where a is 
the number of edges of the path ao • • • Os which are not in H. Let Pf (a, b) 
denote the expected number of such paths where we further require that a 
is not adjacent to ai in H. (When a ^ V these are the same.) We shall de- 
fine inductively Xs,x~ which provide upper bounds to Ps{a, b) and P^{a, b) 
respectively under the further assumption that b ^ V. (We shall see that 
P^{a, b) is dominated by paths which do not overlap H but that for Ps{a, b) 
there is a contribution from those paths which are paths in H for their initial 
segment.) Clearly we may set xi = x^ = p. Let Xg, x~ satisfy the following: 



Xs 


= pnXg_i + pmxs-i 




s-l 


Xs 


= x- + }] 50kx;_^ 



k=l 

We claim such Xs, x~ provide the desired upper bounds. To bound Pf{a, b) 
split paths aai ■ ■ ■ a^-ift according to ai £ V {< m possibilities) and ai ^ V 
(< n possibilities). Note we are excluding the case where a, ai are adjacent 
in H. For a given ai the expected number of paths is pPs^i{ai,b) (as we 
must have a,ai adjacent). When ai ^V this is by induction at most px^_-^ 
and when ai £ V this is by induction at most pXg^i so P~(a,b) < x~ by 
induction. Bounding Ps(a,b) is a bit more complex. Those paths for which 
a, ai are not adjacent in H contribute at most xj by induction. Otherwise, 
let k be the least integer for which a^, a^+i are not adjacent in H. (As b ^V 
this is well defined and 1 < k < s.) We pause for a technical calculation. 

We claim that in H for any k < w there are at most 50/c paths of length 
k beginning at any particular vertex v. Suppose a € H~ . There are at most 
four such paths staying in H~ . Once leaving H~ the path is determined 
(since critically k < w, the path length) and there are at most 8k ways of 

10 



determining when and how to leave H~ . The argument with a H~ is 
similar, we omit the details. Of course 50/c is a gross overestimate but we 
only use that it is a 0{h) bound. 

Back to bounding Ps{a,b). For a given k there are at most 50k choices 
for ai- • • ak and fixing those there is a contribution of P~_i^{ak, b) < xj_^ to 
Ps{a,b). Thus Ps{a,b) < Xg by induction. 

Now to bound the values Xs,xJ given by the inductive formulae. Let L 
be fixed (dependent only on c) so that 



oo 



L>l + J2 50A:c 



fe=i 
and set 



X- =p'n'-^{l + L- 



n 
We claim Xg < Xg and x'^ < X^ . For this we merely check (recall pn = c) 

s-l s-l 

Xi + Y, 50A:X7_;- < (1 + L—)p'n'~\l + ^ 50A;c-^) < Lp'n'-^ = Xg 
k=i "■ fc=i 

as Lms/n = o(l) and that 

pnX'^_^ + pmX g_i = X~ 
Thus we have shown 

when b ^V and further 

P«,(a,6)<n— V[l + 0(i^)] 

n 

when a, 6 y. 

Now (thinking of a, 6 € V) we seek a general bound y^ for Pg{a, b). We 
set 2/1 = 1 (as perhaps a, b are adjacent in i?) and define inductively 

ys = 3 + pnxg-i + pmyg-i + 50sp + ^ 50A;[pna:s_fc_i + pmyg_k-i] 

fe=i 

We claim Ps(a, 6) < y^ for 1 < s < w. Of the potential paths aai • • • ag^ib 
there are at most three which are paths in H and they contributes at most 

11 



three. There are less than 50s cases where aai ■ ■ ■ as-i is a path in H but 
as_i,6 are not adjacent in H and they each contribute p. The cases with 
ai ^V contribute at most pnxg-i- The cases with ai (zV but not adjacent 
to a in H contribute at most pmys-i- Otherwise let 1 < /c < s — 2 be 
the least k so that 0^,0^+1 are not adjacent in H. There are at most 50A; 
choices of ai • • • a^. Then there are at most n choices of ak+i V and each 
contributes pXg-k-i and at most m choices of Ofe+i S V and each contributes 

PVs-k-l- 

Now fix a constant M satisfying 

M > Mi = L[l + J2 SOfcc"^'] 

k=l 

We claim that for 1 < s < w 

Ps{a,b) <A + Mp'n'~^ 
By the previous bounds on Xg we bound 

s-2 

pnxg-i + yj 50kpnXs-k-i < Mip^n^~ 
k=l 

We bound 3 + 50sp < 3.01. By induction we bound 

s-2 

pmys-i + ^50kpmys-k^i < 50s'^pm[A+Mp^n^'^] < .01 + (M-Mi)p'*n'*"^ 

k=l 

since BOs'^pm = 0(log n/n) = o(l), completing the claim. We are really 
interested in the case s = w. Note p^n^^^ = c^ /n is asymptotically a 
positive power of n by the choice made of ki earlier. Thus the +4 may be 
absorbed in M and we have that 

P^{a,b) < Mp'^n'"-^ 

for all a, b while ii a,b ^V then we have the better bound 

n 
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5.3 Expectation of Copies of H 

Now we turn to the full problem of bounding £J*[X]. Recall we have labelled 
H so that 1, . . . , 3w—l are the vertices of H~ . Recall I denotes the number of 
tf-paths in going from H^ to H and recall I ~ elogn. £'*[X] is the sum over 
all m-tuples (ai, . . . , am) of distinct vertices of the probability (in G*{n,p)) 
that these as (in this order) give a copy of H. For each oi, . . . ,03^1,-1 the 
contribution of ?7i-tuples with this start is bounded from above by 



P 



Mp'^n^-^ 



l-A 



p'^n^-^l + Oi 



In n. 



n 



Here a is the number of adjacencies i,j in H~ with ai,aj not adjacent in 
H. A is the number of pairs i, j in H~ which are joined in i? by a u7-path 
and for which neither ai nor Oj is in y. I — A is then the remaining number 
of pairs i, j in H^ joined in /iT by a w-path. 

To see this note that for fixed ai, . . . , 03^-1 and any choice of ty-paths 
Pi, . . . ,Pi that are vertex disjoint the probability that they are all paths in 
G is simply the product of the probabilities for each path. Adding over all 
Pi, . . . ,Pi is then at most the product over j of adding the probabilities for 
each Pj, and these are precisely what the bracketed terms bound. The p", 
of course, is the probability that the Oj have the proper edges of H^ . 

Now we split the contribution to £^[X*] into two classes. First consider 
all those oi, . . . , 03^-1 with no ai € V. There are at most n^^~^ such tuples 
and each gives p^^ with 3w being the number of edges in H^ . For each 
yl = / so this gives a 



p'^n'"-^[l + 0{ 



In^n 



n 



)] 



factor. As / = O(lnn) 



[l + 0( )]' = 1 + 0(1) 



n 



so this entire contribution is asymptotic to 72^"'^"'^+ '^^^^p^"*^ which is 
asymptotic to -©[X], the expectation in G{n,p). That is, the main contri- 
bution (among the ai ■ ■ ■ a3«,-i that don't overlap V) to £'*[X] is by those 
copies of H that don't overlap H at all. To show (V2) it now suffices to 
show that the remaining contributions to £'*[X] are o{E[X]). 

There are 0(ln n) choices of a pair i < 3w — 1 and Oj G V. Fix such 
a pair and consider the contribution to £'*[X] with Oj this fixed value. In 
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H~ fix a cycle C of length 2tf that i lies on. The expected number of 
cycles of length 2w in G through a^ bounded from above by P2w{o,i,ai) 
which we've shown is at most Mn'^'^~^p'^'^. (The analysis done for Pg for 
1 < s < w extends with no change to s = 2w.) Let Q, Q' be the points of 
H~ of degree three. Fixing such a cycle the points aq^aqi are now fixed 
and the expected number of paths of length w between them is at most 
Pwiaq, ciQi) < Mn^^^p"^. Together the expected number of extensions of Oj 
to a copy of H~ is less than 0{n^^~'^p^^), off' from the expected number of 
copies of H~ by a factor of n~^^°^^' The 0(ln n) factor of the choices of i, ai 
can be absorbed into to o(l) so that the expected number of copies of H~ 
overlapping V is still only n^^+°y^> times the expected number of copies of 
H~ in G{n,p). Now given a copy oi, . . . , a^w-i of H^ the expected number 
of extensions to H in G*{n,p) is at most M times what it is in G{n,p), the 
extreme case when all A = 0, e.g., all Ui G V. Thus the total contribution 
to £'*[X] from copies in which H~^ overlaps V is at most 

Recall / ~ elogn. Up to now all constants ki,L,M have depended only on 
c and not e. Now (and formally this is at the very start of the proof, in the 
definition of H) we fix K so large that e is so small so that 

e(logM) < 1 

This assures that n~^^°^^' M is n to a negative power. Thus this contribu- 
tion to E*[X] is only o{E[X]). Hence E*[X] ~ E[X] which concludes the 
argument. 

5.4 Clean Topological fc-Cliques 

For the proof of Theorem 6 we require that for every c > 1 and every integer 
k that G{n, c/n) almost surely contains a CTKk- Fix c, k. We fix a real ki 
with 

ki log c > 1 

Set w = [feilnn]. We define H = H{k,ki,n) to consist of k "special" 
points and between each pair of special points a path of length w. Set 
V = k + (^){w — 1), the number of vertices and t = (2) — k, e = v + t so 
that e is the number of edges. Note e,v = (^1(2) + o(l))logn. We show 
that almost surely G{n, c/n) contains a copy of H. As the argument is very 
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simpler (and simpler) than that just given, we shall give the argument in 
outline form. Letting X denote the number of copies of H we have 

E[X] = {n)^p^ ~ nV = c^n-^ = n'=i(2)l°S'=-*+''(i) 

which is a positive power of n. Now we need show i?*[X] ~ E[X] where 
E* [X] is the expected number of copies of H conditioning on a fixed copy 
of H. Let us specify the fixed copy to be on vertex set F = {1, . . . , w} with 
l,...,fc being the special vertices and let G*{n,p) be G{n,p) conditioned 
on this copy. As before we let P«,(a, b) be the expected number of paths of 
length w between a, 6 in G*. Then, as before, there is a constant M so that 

for all a,b while 

PUa,b)=n^~'p^{l + o{l)) 

if either a or 6 is not in V. (We will not need the more precise error bound 
for this problem.) 

We split the contribution to £'*[X] into two groups. The ai, . . . , a^ which 
do not overlap V contribute 



n'' 



n'^-'p'^il + oil)) 



to E* [X] . Since k is fixed this is asymptotically n'"p^ which is asymptotically 
i?[X]. There are only n ~^°(^> different ai, . . . , Ofc which do overlap V. For 
each the contribution to E* [X] is at most 

Since M and k are constants this only a constant times the contribution 
to -E[X]. Thus the total contribution from these intersecting ai, . . . , a^ is 

n'^+°WE[X] and thus E*[X] ~ E[X] as required. 



References. 

1. J. Lynch, Probabilities of Sentences about Very Sparse Random Graphs, 
Random Structures and Algorithms (to appear) 



15 



